Augment and Reduce: Stochastic Inference for Large Categorical Distributions
نویسندگان
چکیده
Categorical distributions are ubiquitous in machine learning, e.g., in classification, language models, and recommendation systems. They are also at the core of discrete choice models. However, when the number of possible outcomes is very large, using categorical distributions becomes computationally expensive, as the complexity scales linearly with the number of outcomes. To address this problem, we propose augment and reduce (A&R), a method to alleviate the computational complexity. A&R uses two ideas: latent variable augmentation and stochastic variational inference. It maximizes a lower bound on the marginal likelihood of the data. Unlike existing methods which are specific to softmax, A&R is more general and is amenable to other categorical models, such as multinomial probit. On several large-scale classification problems, we show that A&R provides a tighter bound on the marginal likelihood and has better predictive performance than existing approaches.
منابع مشابه
Deterministic Annealing for Stochastic Variational Inference
Stochastic variational inference (SVI) maps posterior inference in latent variable models to nonconvex stochastic optimization. While they enable approximate posterior inference for many otherwise intractable models, variational inference methods suffer from local optima. We introduce deterministic annealing for SVI to overcome this issue. We introduce a temperature parameter that deterministic...
متن کاملBayesian inference about odds ratio structure in ordinal contingency tables
When the goal of a study is to compare two groups on an ordinal categorical scale, a large number of inferential methods are available. Most methods are designed to detect a location effect, such as by focusing a single-degree-of-freedom test on an effect parameter. Often, rather than merely summarizing by a P -value to describe the evidence against a null hypothesis, it is of interest to consi...
متن کاملHow to use the catnet package
The R package catnet provides an inference framework for categorical Bayesian networks. Bayesian networks are graphical statistical models that represent causal dependencies between random variables. A Bayesian network has two components: a Directed Acyclic Graph (DAG) with nodes representing random variables and a probability structure specified by conditional distributions, one for each node ...
متن کاملBayesian Analysis of Stochastically Ordered Distributions of Categorical Variables
This paper considers a nite set of discrete distributions all having the same nite support. The problem of interest is to assess the strength of evidence produced by sampled data for a hypothesis of a speciied stochastic ordering among the underlying distributions and to estimate these distributions subject to the ordering. We present a Bayesian approach alternative to the use of the posterior ...
متن کاملCopula variational inference
We develop a general variational inference method that preserves dependency among the latent variables. Our method uses copulas to augment the families of distributions used inmean-field and structured approximations. Copulas model the dependency that is not captured by the original variational distribution, and thus the augmented variational family guarantees better approximations to the poste...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1802.04220 شماره
صفحات -
تاریخ انتشار 2018